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Deep-sea hydrothermal vent chimneys contain a high diversity of microorganisms, 
yet the metabolic activity and the ecological functions of the microbial communities 
remain largely unexplored. In this study, a metagenomic approach was applied to 
characterize the metabolic potential in a Guaymas hydrothermal vent chimney and to 
conduct comparative genomic analysis among a variety of environments with sequenced 
metagenomes. Complete clustering of functional gene categories with a comparative 
metagenomic approach showed that this Guaymas chimney metagenome was clustered 
most closely with a chimney metagenome from Juan de Fuca. All chimney samples 
were enriched with genes involved in recombination and repair, chemotaxis and flagellar 
assembly, highlighting their roles in coping with the fluctuating extreme deep-sea 
environments. A high proportion of transposases was observed in all the metagenomes 
from deep-sea chimneys, supporting the previous hypothesis that horizontal gene 
transfer may be common in the deep-sea vent chimney biosphere. In the Guaymas 
chimney metagenome, thermophilic sulfate reducing microorganisms including bacteria 
and archaea were found predominant, and genes coding for the degradation of refractory 
organic compounds such as cellulose, lipid, pullullan, as well as a few hydrocarbons 
including toluene, ethylbenzene and o-xylene were identified. Therefore, this oil-immersed 
chimney supported a thermophilic microbial community capable of oxidizing a range 
of hydrocarbons that served as electron donors for sulphate reduction under anaerobic 
conditions. 



Keywords: metagenome, deep sea, hydrothermal vent, chimney, Guaymas, biodegradation, hydrocarbon, sulfate 
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INTRODUCTION 

Deep-sea hydrothermal vents characterized by steep physico- 
chemical gradients harbor a wide range of microorganisms in 
different ecological niches, including the high-temperature chim- 
ney matrix (Reysenbach and Shock, 2002). Deep-sea hydrother- 
mal vent chimneys are the product of hydrothermal circulation 
and alteration of seawater entrained through geothermally heated 
subseafloor basalt, and subsequent precipitation of mental sul- 
fides when hot vent fluids emerge into cold sea water (Von Damm, 
1990). The geochemical disequilibria within and surrounding 
chimneys provide rich energy sources for microorganisms, as var- 
ious reduced chemicals (such as sulfur, methane, and H2) are 
utilized as potential electron donors (Jannasch and Mottl, 1985; 
Distel et al., 1988; Lam et al., 2004; Petersen et al., 2011). The 
structures of these microbial communities are shaped primar- 
ily by variation of hydrothermal fluid composition, in particular 
H2 concentrations (Flores et al., 2011). Thanks to advances in 
sequencing technologies, molecular microbial diversity studies 
of deep-sea hydrothermal environments have made significant 
progress in understanding the geographic distributions of these 
microbial communities (Teske et al., 2002; Huber et al., 2007, 



2010; Brazelton et al., 2010; Dick and Tebo, 2010; Roussel et al., 
2011). 

Despite the increased knowledge of the microbial diversity of 
deep-sea hydrothermal vents, much less was known about the 
metabolic potential and ecological functions of these commu- 
nities, especially when considering that less than 1% of envi- 
ronmental microorganisms could be cultured under laboratory 
conditions (Amann et al., 1995). Metagenomic-based meth- 
ods have provided unique opportunities to explore the features 
of microbial communities from diverse deep-sea hydrothermal 
vent environments. So far, metagenomes from two deep-sea 
hydrothermal vent chimneys have been published, from a carbon- 
ate white chimney at Lost City with relatively low temperature 
and high pH (<90°C, pH 9-11) (Brazelton and Baross, 2009), 
and from a chimney sample collected from Juan de Fuca (Xie 
et al., 2011) characterized by high temperature and low pH flu- 
ids (>300°C, pH 2-3). Both metagenomes were found enriched 
in transposases, implying that horizontal gene transfer may be 
a common feature of hydrothermal vent chimney biosphere. 
Comparative metagenomic studies with more chimney samples 
from different deep-sea hydrothermal vents should be performed 
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to reveal common features of the chimney-originated microbial 
communities. 

The Guaymas Basin (Gulf of California) is a unique hydro ther- 
mal vent site, where emitted high-temperature fluids are influ- 
enced by the presence of a thick layer (100-500 m) of sediments 
(with 2-4% organic matter, OM). These sediments were formed 
by precipitation from the highly productive surface waters and 
terrigenous input (Von Damm et al., 1985). The venting flu- 
ids are characterized by an increase of pH (around 6.0), and 
a decrease in the highest temperature of fluids emitted on the 
seafloor (270-325°C) (Von Damm et al, 1985). Therefore, the 
chimney sample from the Guaymas site would be an ideal sam- 
ple for comparative metagenomic analysis together with the two 
published ones. In addition, Guaymas Basin is unique due to 
the fact that under high-temperature conditions, the OM in its 
rapidly accumulating sediments is pyrolized to petroleum-like 
hydrocarbon products, such as aliphatic and aromatic hydrocar- 
bons, short-chain fatty acids, ammonia, and methane (Bazylinski 
et al., 1988; Welhan, 1988; Martens, 1990; Kawka and Simoneit, 
1994). The hydro thermally active sediments of the Guaymas 
Basin were reported with intensive methane oxidizing and sul- 
fate reducing activities (Jorgensen et al., 1990, 1992; Elsgaard 
et al, 1994; Weber and Jorgensen, 2002; Kallmeyer and Boetius, 
2004). Various hydrocarbon degrading microorganisms which 
are using sulfate as the electron acceptor have been isolated from 
Guaymas Basin sites and similar marine habitats (Riiter et al., 
1994; Galushko et al, 1999; Musat and Widdel, 2008; Kleindienst 
et al., 2012). Nevertheless, the metabolic potential, in particular 
for hydrocarbon degradation, of the whole microbial commu- 
nity from Guaymas vent chimneys has never been investigated. 
Therefore, comprehensive studies on biodegradation, in particu- 
lar under anaerobic conditions, still remain to be conducted to 
characterize the structure and metabolic potential of microbial 
ecosystems with capability for hydrocarbon biodegradation. 

In this study, the metagenome of an oil-immersed chimney 
in Guaymas Basin was analyzed to demonstrate the metabolic 
potential and ecological functions of the inhabited micro- 
bial community. Additionally, questions related to anaerobic 
biodegradation of hydrocarbons are addressed: do the microor- 
ganisms from this chimney community have anaerobic hydro- 
carbon degradation activities? If so, what kinds of hydrocarbons 
could be potentially degraded? Which groups of microorgan- 
isms are responsible for carrying out the degradations? Which 
processes are coupled/closely related to the degradation for elec- 
tron transfer? Are there any features shared among different 
chimney-originated samples? 

MATERIALS AND METHODS 
DNA EXTRACTION AND SEQUENCING 

The sample 4558-6 under investigation represented the outer 
layer of a black-smoker chimney with preliminary venting 
fluid temperature 190°C (measured above the chimney prior 
to sampling), and was collected in Guaymas Basin (27°0.9'N, 
111°24.6'W, depth = 2013 m) by the HUV Alvin (supported by 
the R/V Atlantis) in November, 2009. The chimney (Figure Al) 
was kept at — 20° C immediately after sample collection, and 
stored with dry ice during transportation and stored at — 80° C 



in laboratory until further analyses. The genomic DNA was 
extracted from outer sections of all collected samples where high- 
est DNA quantity was found. Isolation of DNA was conducted as 
described in a previous study (Wang et al, 2009). Metagenome 
pyrosequencing was performed according to company protocol 
on the 454 Life Sciences GS FLX system with a practical limit of 
400 bp. All the sequences were deposited in the MG-RAST server. 

METAGENOMIC ANALYSIS AND ASSEMBLY 

Low quality sequencing reads were trimmed in Geneious 6.04 
(Biomatters Ltd.) with default parameters. Technical replicates 
(Gomez-Alvarez et al, 2009) were removed with cd-hit (at 96% 
sequence identity) (Niu et al., 2010). Shorter reads (<100bp) 
were then excluded, and the remaining reads were assem- 
bled with Velvet (Zerbino and Birney, 2008). This metagenome 
from Guaymas Basin chimney was uploaded (MG-RAST ID: 
4510962.3) and analyzed using MG-RAST (Meyer et al, 2008). 

METAGENOMIC SEQUENCE ANALYSIS 

Coding regions within the metagenome were predicted using 
FragGeneScan (Rho et al., 2010), and the predicted sequence 
features were then annotated (e-value < le-5) against M5NR pro- 
tein database. 16S rRNA genes were predicted with HMM and 
BLASTN (e-value <le-5) in webMGA (Wu et al, 2011), respec- 
tively. To analyze the taxonomic contents, all predicted gene 
features were subject to blastx (Altschul et al., 1997) searches 
against NCBI non-redundant (NR) database (e-value <le-5, 
word size = 3, multi hit window size = 40 and low complex- 
ity filter on) and visualized in MEGAN (Huson et al., 2007). 
Each predicted sequence feature in the metagenome was assigned 
to a certain taxon when at least 75% of the BLAST hits of this 
query were from that specific taxon. Sequences with matches to 
the eggNOG (Powell et al, 2012), COG (Tatusov et al, 2003) 
and KEGG (Ogata et al., 1999) database were retrieved to build 
functional categories and reconstruct metabolic pathways. 

ANNOTATION OF SEQUENCES WITH DEGRADATION/METABOLIC 
ACTIVITIES 

Sequences that were annotated as enzymes in the degrada- 
tion of cellulose and fatty acids were extracted and sub- 
ject to manual examination. Annotated KEGG pathways in 
this metagenome were visualized in MEGAN to demonstrate 
the metabolic potential in the microbial community. Enzymes 
involved in the degradation of a few organic chemicals (such 
as benzene, toluene, ethylbenzene, and xylenes) were collected 
from the Biocatalysis/Biodegradation Database (BBD) of the 
University of Minnesota (Gao et al., 2010), an online web ser- 
vice that listed known degradation pathways for hundreds of 
chemicals/contaminants. Within this chimney metagenome, a 
sequence-similarity based search against BBD was conducted to 
identify candidate genes that were involved in the biodegrada- 
tion of certain hydrocarbons. Sequences of previously reported 
anaerobic alkane degradation genes (Callaghan et al., 2010), ben- 
zylsuccinate synthase (foss) and alkylsuccinate synthase (ass), were 
retrieved from GenBank (accession no.: DQ826035, DQ826036, 
AJ001848, AB066263, AY032676, and AF113168) and searched 
against our Guaymas metagenome. 
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CLUSTERING ANALYSIS OF FUNCTIONAL CATEGORIES 

Clustering of functional categories (with KEGG annotation, 
e-value < le-5, min. identity of 30% and min. align, length of 15 
a.a.) was conducted in MG-RAST (using ward with Canberra dis- 
tance metric based on normalized values) among metagenomes 
as following: Guaymas Basin chimney 4558-6 (MG-RAST ID: 
4510962.3), Juan de Fuca chimney (MG-RAST ID: 4510965.3) 
(Xie et al., 2011), Lost City hydrothermal vent field (MG- 
RAST ID: 4461585.3) (Brazelton and Baross, 2009), two biofilm 
samples from acid mine drainage (UBA and 5way, with MG- 
RAST ID 4441137.3 and 4441138.3, respectively) (Tyson et al., 
2004), a gutless worm (MG-RAST ID: 4441115.3) (Woyke et al., 
2006), the North Pacific Subtropical Gyre microbial commu- 
nities (the HOT project with depth of 10, 70, 130, 200, 500, 
770 and 4000 m, and with MG-RAST ID: 4441051.3, 4441057.4, 
4441055.3, 4441041.3, 4441057.3, 4441062.3, and 4441056.3, 
respectively) (Delong et al., 2006) and a deeply buried sediments 
from Peru Margin (for sample original, amlmbsf, aml6mbsf, 
am32mbsf and am50mbsf, and with MG-RAST ID: 4440960.3, 
4440961.3, 4440973.3, 4459940.3, and 4459941.3, respectively) 
(Biddle et al, 2008). All metagenomes were stored in MG-RAST 
database. 

RESULTS 

SEQUENCING SUMMARY AND COVERAGE 

Initially, as shown in Table 1, a total amount of 512,830 reads 
and 196,377,880 bp of sequence data were generated by 454 
pyrosequencing. After removing low-quality reads and techni- 
cal duplicates, the remaining 504,915 reads (with an average 
length 383 bp) were assembled into 49,055 contigs (totaling 
26,241,624 bp, with an average length of 543 bp). 187,308 sin- 
gletons with an average length of 367 bp could not be assem- 
bled. From this assembly, 52,366 gene features were predicted, 
37,372 of which (71.4%) were with known annotations. A rar- 
efaction analysis of the final assembly was conducted based on 
the taxonomic information retrieved from annotation results in 
MG-RAST (Figure 1), which indicated that a reasonable num- 
ber of individual genomes were sampled and covered in the 
metagenome. 

TAXONOMIC DIVERSITY BASED ON 16S rRNA GENE PREDICTION 

70 and 90 16S rRNA sequences were predicted from webMGA 
(Wu et al, 201 1), with the use of HMM and BLASTN, respectively 
(Table 2). In both of the results, around 3/4 (72.9 and 76.7%, 



Table 1 | Summary of sequences from the chimney sample 4558-6. 



Properties Value 



Clean reads 504,915 

Base pairs 193,336,182 

GC content 42% 

Average read length 383 

Singletons 187,308 

Contigs 49,055 

Average contig length 543 

Best hit gene features with NR database 37372 



respectively) of all identified 16S rRNA sequences were from bac- 
teria. Deltaproteobacteria and Euryarchaeota had dominated the 
bacteria and archaea, respectively. 

PHYL0GENY FROM 454 PYROSEQUENCING DATA 

A total number of 37,372 predicted gene features were uti- 
lized to extract taxonomic information by BLASTing against 
the NCBI NR database. In total, 21,792 (58.3%) and 11,249 
(30.1%) hits were assigned to bacteria and archaea (rules for 
taxonomic assignment were stated in "Materials and Methods"), 
respectively (Table Al). At the phylum level, Proteobacteria 
(9558, 25.6%), Euryarchaeota (9078, 24.3%), Crenarchaeota 
(1354, 3.6%), Firmicutes (786, 2.1%), Thermotogae (725, 1.9%), 
and Bacteroidetes (607, 1.6%) had the top matches (Figure 2). 
Taxa with most hits assigned were listed in Table 3. Notably, 
the majority of sequences in this chimney metagenome orig- 
inated from the families Archaeoglobaceae, Thermococcaceae, 
and Desulfobacteraceae, and a high proportion (>21.2%) of 
sequences potentially originated from sulfate reducing prokary- 
otes (SRP). Moreover, thermophilic sulfate reducing microorgan- 
isms, including bacteria (Thermodesulfobacteriaceae) and archaea 
(Archaeoglobaceae) were found predominant, highlighting their 
roles in reducing sulfates and/or sulfites to sulfides during energy 
metabolism under anaerobic conditions. 

FUNCTIONAL CATEGORY HITS DISTRIBUTION 

Major KEGG function categories were listed (Table 4) and 
ordered by the number of unique hits assigned to each category. 
Similar to a previous study on the chimney from hydrother- 
mal vents (Xie et al., 2011), genes involved in Recombination 
and Repair were among the abundant categories (Table 4). In 
addition, transposaes were found to be highly enriched in this 
sample (Table A3), and genes participated in Chemotaxis and 
Flagellar Assembly was all identified with high abundance in this 
metagenome. 

DEGRADATION OF REFRACTORY 0M AND PETROLEUM 
HYDROCARBONS 

Sequences coding for the complete degradation pathway of cel- 
lulose and fatty acids, as well as key enzymes involved in 
breakdown of lipid and pullulan were all identified in this 
metagenome (Table A2). Many of these identified enzymes were 
with thermorphilic-origin best hits (as marked with asterisk in 
Table A2). Peptidases were found to be of low abundance in 
this metagenome (data not shown). For anaerobic hydrocarbon 
degradation, sequences coding for benzylsuccinate synthase (bss) 
and alkylsuccinate synthase (ass), key enzymes in the fumarate 
addition pathway for the anaerobic oxidation of hydrocarbons 
(Callaghan et al, 2010), were identified (Table 5). In particu- 
lar, gene candidates involved in BTEX (benzene, toluene, ethyl- 
benzene, and xylenes) degradation were identified by searching 
against the BBD of the University of Minnesota (Gao et al., 2010) 
(Table 5). Additionally, to show the metabolic potential of this 
Guaymas sample in the degradation and remediation of organic 
contaminants (He et al, 2010), genes involved in the degra- 
dation of aromatic carboxylic acid (benzoate, phenylpropionate 
and phthalate), chlorinated aromatics (2- and 4-chlorobenzoate, 
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FIGURE 1 | Rarefaction plot of the total number of distinct taxon annotations (MG-RAST) as a function of the number of sequences from the 
assembly. 



Table 2 | Taxonomic diversity based on 16S rRIMA gene prediction 
from the metagenome. 

Methods Domain [total no. Diversity (% in domain) 
of sequences (%)] 

HMM Bacteria (51, 72.9) Deltaproteobacteria (25.4) 

Gammaproteobacteria (21.6) 
Epsilonproteobacteria (13.7) 
Deferribacteres (7.8) 

Archaea (19, 27.1 ) Euryarchaeota (63.2) 

Crenarchaeota/Thermoprotei (36.8) 

BLASTN Bacteria (69, 76.7) Deltaproteobacteria (23.4) 

Gammaproteobacteria (18.8) 
Epsilonproteobacteria (12.5) 
Acidobacteria (9.4) 
Deferribacteres (7.8) 

Archaea (21 , 23.3) Euryarchaeota (52.4) 

Crenarchaeota/Thermoprotei (47.6) 



2,4,5-trichlorophenoxyacetic acid), heterocyclic aromatics (car- 
bazole and dibenzothiophene), nitroaromatics (nitrobenzene and 
nitrophenol) as well as a few of other hydrocarbons (cyclohex- 
ane and tetrahydrofuran) were searched within the Guaymas 
metagenome. Notably, this chimney sample seemed to have 
the potential to degrade toluene, ethylbenzene and o-xylene 
(Table 5), yet no such evidence for benzene or the rest has been 
detected. 



COMPARATIVE METAGEN0MIC ANALYSIS 

Clustering on (KEGG) functional gene categories was con- 
ducted among different environmental samples (Figure 3). 
Guaymas chimney sample 4558-6 (4510962.3) was clustered 
most closely with chimney sample from Juan de Fuca 
(4510965.3). Both of these two metagenomes were almost 
depleted in categories of RNA family and folding, sorting 
and degradation, while they showed higher abundance in 
the categories of signaling molecules, and interaction and 
cell communication. These features might be highly related 
to the specific environmental conditions where these two 
chimney samples were collected. The metagenome of the 
Lost City chimney sample (4461585.3) was not grouped 
with the above two chimney samples. The two biofilm sam- 
ples from acid mine drainage (4441137.3 and 4441138.3) 
shared similar functional profiles in the heatmap, adjacent 
to the chimney sample from Lost City and the gutless 
worm sample (4441115.3). Samples from the North Pacific 
Subtropical Gyre microbial communities (4441051.3, 4441057.4, 
4441055.3, 4441041.3, 4441057.3, 4441062.3, and 4441056.3) 
were grouped next to each other in the functional heatmap, 
and a similar pattern was observed for Peru Margin sediments 
samples (4440960.3, 4440961.3, 4440973.3, 4459940.3, and 
4459941.3). Enzymes involved in hydrocarbon biodegradation 
in metagenomes from different environments were listed in 
Table A3. When compared to metagenomes from different envi- 
ronments, 4458-6 from Guaymas hydrothermal vent chimney 
was the only one with enzymes for biodegradation of toluene, 
ethylbenzene, and o-xylene (Table A3), highlighting its metabolic 
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FIGURE 2 | Taxonomic information at the phylum level with number of sequences in each phylum, based on BLASTX (1e 5 ) results. 



Table 3 | Top families with the most hits assigned. 



Family 


No. of hits 


Phylum 




assigned (%) 




Archaeoglobaceae 


4777 (12.78) 


Euryarchaeota 


Thermococcaceae 


2365 (6.33) 


Euryarchaeota 


Desulfobacteraceae 


2117 (5.66) 


(Delta) Proteobacteria 


Thermotogaceae 


723 (1.93) 


Proteobacteria 


Desulfurococcaceae 


672 (1.80) 


Crenarchaeota 


Helicobacteraceae 


375 (1.00) 


(Epsilon) Proteobacteria 


Thermodesulfobacteriaceae 


355 (0.95) 


Proteobacteria 


Colwelliaceae 


344 (0.92) 


(Gamma) Proteobacteria 


Thiotrichaceae 


214(0.57) 


(Gamma) Proteobacteria 


Desulfovibrionaceae 


208 (0.56) 


(Delta) Proteobacteria 


Sulfate reducing prokaryotes 


7921* (21.2) 





The exact number of blast hits was listed for each family, with the percentages 
(out of total hits) listed in the parentheses. 

"The number of hits for SRP was generated by summing all the 
hits from Archaeoglobaceae, Desulfobacteraceae, Desulfurococcaceae, 
Desulfovibrionaceae, and Thermodesulfobacteriaceae. 



potential in degrading hydrocarbons in the native oil-immersed 
condition. 

DISCUSSION 

Advances in sequencing technologies have made microbial diver- 
sity studies easier and more accurate. However, biases were 



introduced during sequencing and analyzing of environmental 
sequences. For instance, biases were generated when multiple 
reads were produced for a unique DNA fragment in a random 
manner. Such biases might result in an inaccurate representation 
of the fragments and lead to misleading conclusions. Therefore, 
strict quality control of the sequenced reads should be done. 
In this study, a decent sequencing coverage has been reached 
(Figure 1) and we are in the position to investigate the taxo- 
nomic diversity as well as the metabolic potential of this Guaymas 
chimney sample. 

DIVERSITY AT DIFFERENT TAXONOMIC LEVELS 

Estimated from both 16S rRNA sequences as well as taxonomic 
classifications based on blast hits, the proportion of bacteria in 
the community was about 60-75%, and 25-30% for archaea. 
Bacteria were dominated by Proteobacteria, while Euryarchaeota 
were most abundant among the archaea. So far, sulfate-reducing 
bacteria and archaea have been isolated in vent chimneys, with 
either high or low growth temperature (30-90°C) (Burggraf et al., 
1990; Audiffrin et al, 2003; Moussard et al., 2004). Sulfate reduc- 
tion may occur at temperatures up to 110°C in hot sediments 
from Guaymas hydrothermal field (Jorgensen et al., 1992). In 
this study, a high proportion (at least 21.2%) of sequences from 
this chimney metagenome was potentially originated from SRP 
(Table 3), a large and extremely diverse physiological group of 
anaerobic microorganisms. Sulfate-reducing bacteria and archaea 
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were capable of degrading a wide range of organic substrates 
(Widdel and Bak, 1992; Widdel and Rabus, 2001), including 
petroleum-based products that were discussed in this study. The 
presence of large numbers of SRP in the oil-immersed chim- 
ney suggested that the microbial community had the poten- 
tial of hydrocarbon biodegradation, which were likely to be 
coupled with sulfate reduction. Besides, most of the retrieved 
sequences were estimated to originate from thermophilc microor- 
ganisms such as Archaeoglobus, heterotrophic Thermo coccales and 
Thermodesulfobacteriaceae, reflecting the influence of the high 
temperature on the structure of the microbial community. 



Table 4 | List of major KEGG families. 



KEGG (level 2) 


Avg. % ident 


Avg. aligned 


No. of hits 


annotation 




len. 




Amino acid metabolism 


69.68 


80.99 


1756 


Carbohydrate metabolism 


69.96 


75.17 


1365 


Translation 


71.01 


66.33 


1301 


Membrane transport 


68.01 


70.23 


836 


Metabolism of cofactors 


68.73 


71.72 


723 


and vitamins 








Energy metabolism 


72.02 


73.84 


689 


Nucleotide metabolism 


68.99 


68.01 


550 


Replication and repair 


67.7 


71.41 


515 


Folding, sorting and 


71.9 


71.82 


386 


degradation 








Signal transduction 


67.86 


62.85 


375 


Cell motility 


70.33 


57.04 


258 


Glycan biosynthesis and 


67 


65.77 


240 


metabolism 








Transcription 


73.36 


75.02 


213 


Cell growth and death 


68.78 


64.14 


209 


Lipid metabolism 


70.28 


64.61 


184 



Average sequence identity (Avg % ident) and aligned sequence length (Avg 
aligned len) were presented. No. of Hits represented the number of unique hits 
within each functional protein category. Uncharacterized proteins were excluded. 



METABOLIC POTENTIAL FOR HYDROCARBON DEGRADATION 

Deep-sea environments have been characterized by the lack of 
easily biodegradable OM, thus genes related to the degrada- 
tion of refractory OM have been extensively recovered from the 
metagenomes of deep oceans ( Martin -Cuadrado et al, 2007). 
Moreover, bacteria isolated from the deep sea have been shown 
to be capable of degrading refractory OM such as chitin and 
cellulose (Hedges et al, 2000; Vezzi et al, 2005; Wang et al, 
2008). Here, the potential of a chimney microbial commu- 
nity for refractory OM degradation was evaluated. Genes cod- 
ing for chitin degradation were not found in the metagenome, 
while all the genes involved in the degradation of cellulose 
and fatty acids, as well as a few key enzymes in the break- 
down of lipid and pullulan were identified, which probably 
reflected in part the input of terrestrial OM circulating in this 
Guaymas vent field. As most of the identified enzymes had 
a presumably thermophilic origin (Table 5 and Table A2), and 
thermophilic microorganisms were predominant in the chimney 
sample (Table 3), it was likely that these enzymes had some degree 
of heat tolerance. Additionally, microorganisms of this Guaymas 
chimney were predicted to have the potential to degrade toluene, 
ethylbenzene, and o-xylene. Notably, anaerobic hydrocarbon 
degrading microorganisms have been successfully enriched and 
most extensively studied with the benzene-toluene-ethylbenzene- 
xylenes (BTEX) group of petroleum hydrocarbons (Stockton 
et al., 2009). For example, pure culture strain EbS7 was iso- 
lated from the sediments of Guaymas Basin which was reported 
with the ethylbenzene-dependent sulfate reduction (Kniemeyer 
et al., 2003). This Guaymas chimney sample was oil immersed, 
thus it was not surprising to see the presence of genes involved 
in the degradations of petroleum hydrocarbons (Table 5). Our 
data further highlighted the potential of this microbial com- 
munity using the fumarate addition pathway for the degrada- 
tion of aromatic and aliphatic hydrocarbons. Identification of 
genes coding for hydrocarbon degradation would advance char- 
acterizations of the potential source of electrons and energy, 
as well as the roles of this chimney microbial community had 
played in its native environment. Notably, this Guaymas chimney 



Table 5 | List of annotated gene features involved in degradation of hydrocarbons, absense of gene was marked with "-" 



Substrate 


Enzyme (EC) 


Best hit contig 


Metagenome best hit organism 


Similarity % (e-value) 


n-Alkanes 


assA1 


Contig00511 


Desulfoglaeba alkanexedens AK-01 


78 (5.2e-27) 




assA2 


Contig45077 


Desulfoglaeba alkanexedens AK-01 


95 (9e-10) 


Aromatic hydrocarbons 


bssA 


Contig28667 


Desulfotomaculum sp. 0x39 


76 (1.7e-6) 




bssB 










bssC 


Contig46590 


Desulfatibacillum alkenivorans AK-01 


35 (2e-7) 




bssD 


Contig29655 


Desulfatibacillum alkenivorans AK-01 


67 (1e-35) 




bssE 


Contig46456 


Azoarcus sp. 


72 (1e-6) 




bssG 


Contig46590 


Desulfatibacillum alkenivorans AK-01 


79 (1e-6) 


Toluene 


toluene 4-monooxygenase 


Contig31764 


Dechloromonas aromatica (strain RGB) 


77 (1.6e-64) 


Ethylbenzene 


ethylbenzene dehydrogenase 


Contig32722 


Desulfococcus oleovorans Hxd3 


86 (3.4e-145) 


o-Xylene 


o-xylene monooxygenase 


Contig31764 


Dechloromonas aromatica (strain RGB) 


77 (1.6e-64) 
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FIGURE 3 | The data was compared to KEGG database using a 
maximum e-value of 1e~ 5 , a minimum identity of 30%. The 

heatmap was clustered using ward clustering with Canberra distance 



metric and grouped at level 2 of KEGG annotation. The shading is 
proportional to the normalized values (0-1) calculated on the functional 
enrichment of each category per sample. 



metagenome seemed to be the only one (among all the sam- 
ples included in the comparative metagenomic analysis) with 
metabolic potential for hydrocarbons biodegradation (Table A3). 
Enzymes activated by oxygen (namely active under aerobic con- 
ditions) were not identified in our metagenome, consistent with 
the strict anaerobic condition where this chimney sample was 
collected. Metagenomic analysis suggested that degradation of a 
variety of petroleum hydrocarbons by SRP might play an impor- 
tant role in the energy metabolism of this chimney microbial 
community. 

COMPARISON AMONG DIFFERENT SAMPLES 

For the moment, only three metagenomes from hydrothermal 
vent chimney are available, one from Juan de Fuca (Xie et al., 
2011), one from Lost City (Brazelton and Baross, 2009) and the 
last one was our chimney sample 4458-6 from Guaymas Basin. 
The whole metagenome-based comparison (Figure 3) showed 
that Lost City was not clustered next to the other two chim- 
ney samples, which could be due to the fact that the Lost City 
sample was collected from a white carbonate chimney (CC) 
rather than a back sulfide chimney (SC) as the other two sam- 
ples were. When considering the fact that the dominant energy 
sources for SC and CC were significant different (i.e., metal sul- 
fides in SC and reduced volatiles such as hydrogen, methane 
in CC, respectively), the genomic differences between the two 
types of chimneys become clear as different energy metabolisms 
were promoted accordingly. Moreover, the venting fluid of Lost 
City had a pH of 9-11 and temperatures lower than 90° C, 
whereas that of Juan de Fuca sample was acidic and hot (tem- 
perature around 310°C); the vent fluid of our Guaymas chimney 



sample represented an intermediate temperature regime (190°C). 
Differences in environmental factors (such as temperature, pH, 
and physico-chemical gradients) between white and black chim- 
neys could also have a significant influence on the distribution 
of functional gene categories, and as a result the three chim- 
ney samples were not clustered next to each other. On the other 
hand, when looking at specific gene categories, metagenomes 
from hydrothermal vent chimneys (Guaymas Basin, Juan de 
Fuca and Lost City) kept a larger collection of genes involved 
in recombination and repair, chemotaxis and flagellar assem- 
bly, as well as transposases. This could be a chimney- specific 
feature, reflecting adaptations of microorganisms and the micro- 
bial community to the extreme fluctuating chemical and phys- 
ical conditions that characterize deep-sea hydrothermal vent 
chimneys. 
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APPENDIX 




FIGURE Al | The photo of a black-smoker chimney sample (4558-6), 
collected in Guaymas Basin (27°0.9'N, 111°24.6'W, depth = 2013 m). 



Table A1 | Taxonomic information based on assigned blastx results to 
each domain. 



Domain 


No. of hits assigned 


% of hits assigned 


Bacteria 


21792 


58.3 


Archaea 


11249 


30.1 


Eukaryota 


224 


0.6 


Virus 


88 


0.2 


Unassigned 


4019 


10.8 



Frontiers in Microbiology | Extreme Microbiology 



June 2013 | Volume 4 | Article 148 | 10 



He et al. 


Metagenome of a Guaymas chimney sample 


Table A2 | List of annotated protein features involved in degradation processes. 


Substrate: step enzyme 


Best BLAST hit organism 




STEP [CELLULOSE]: ->• CELLULOSE CELLOBIOSE 


Endo-1 ,4-beta-glucanase 


Thermotoga neapolitana* 


Cellulase 


Thermosipho melanesiensis BI429* 




Methanococcoides burtonii DSM 6242 


Endoglucanase 


Archaeoglobus fulgidus DSM 4304* 


Cellobiose beta-D-Glucose 


Beta-glucosidase 


Dictyoglomus thermophilum H-6-12 (2) 


STEP [FATTY ACIDS]: FATTY ACIDS -+ HEXA-DECANOYL-COA 


Long chain fatty acid CoA ligase 


Bacillus thuringiensis serovar huazhongensis BGSC 4BD 1 




Bacteroides sp.2_1_7 




Bacillus cereus BURD-S I 196 




Desulfobacterium autotrophicum HRM2I2) 




Carboxydothermus hydrogenoformans Z-2901* 




Archaeoglobus fulgidus DSM 4304 (16)* 


A/^v/ PnA ci/nthotaco 
MCyl LUM byl I U Ic LdbfcJ 


ilUI UfJ! IIIUzi LUI 1 lUUci LJyDtVI Z? / Z/U 




^wntrnnht i<z Afiriitrnnhici i^ 




Desulfatibacillum alkenivorans AK-01 




Archaeoglobus fulgidus DSM 4304 (6)* 


Hexa-decanoyl-CoA — Trans-hexadec-2-enoyl-CoA 




Acyl-CoA dehydrogenase 


Pseudomonas aeruginosa PA 7 




Marinobacter algicola DG893 




Colwellia psychrerythraea 34H 




Oceanobacter sp. RED65 




Oceanospirillum sp. MED92 




Gamma proteobacterium N0R51-B 




Brucella abortus bv. 1 str. 9-94 1 




Labrenzia alexandrii DFL-11 




Brucella suis bv. 5 str. 513 




Cytophaga hutchinsonii ATCC 33406 




Candidatus Cloacamonas acidaminovorans 




Bacillus clausii KSM-K16 




Archaeoglobus fulgidus DSM 4304 (16)* 


Trans-hexadec-2-enoyl-CoA (S)-3-Hydroxy-hexa-decanoyl-CoA 




Beta-hydroxyacyl-CoA dehydrase 


Desulfuromonas acetoxidans DSM 684 




Desulfococcus oleovorans Hxd3 (4) 


Enoyl-CoA dydratase 


Burkholderia multivorans CGD2M 




Acidaminococcus sp. D21 




Thermosinus carboxydivorans Nor1 




Desulfococcus oleovorans Hxd3 (4) 




Desulfatibacillum alkenivorans AK-01 




Geobacter bemidjiensis Bern 




Candidatus Koribacter versatilis Ellin345 




Marinomonas sp. MWYL 1 




Colwellia psychrerythraea 34H 




Carboxydothermus hydrogenoformans Z-2901* 




Archaeoglobus fulgidus DSM 4304* 




Rubrobacter xylanophilus DSM 9941 (2)* 


(Continued) 
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Table A2 | Continued 


Substrate: step enzyme 


Best BLAST hit organism 


(S)-3-Hydroxy-hexa-decanoyl-CoA -* 3-Oxo-hexa-decanoyl-CoA 




3-Hydroxyacyl-CoA dehydrogenase 


Gallionella ferruginea ES-2 




Desulfococcus oleovorans Hxd3 




Xanthobacter autotrophicus Py2 




Rhodospirillum centenum SW 




Geobacter uraniireducens Rf4 




Arcnaeogiouus ruigious uoivi 4JUh f iz) 


Beta-hydroxyacyl dehydrogenase 


Desulfococcus oleovorans Hxd3 (4) 




Desulfuromonas acetoxidans DSM 684 


3-Ketoacyl-CoA thiolase 


Colwellia psychrerythraea 34H 




Halobacterium sp. NRC- 1 




Anaeromyxobacter dehalogenans 2CP-C 




Archaeoglobus fulgidus DSM 4304 (6)* 


Thiolase 


Geobacter metallireducens GS-15 




Polaromonas sp. JS666 






Pullulanase 


Thermococcus onnurineus NA 1 (3)* 




Thermococcus hydrothermalis* 




Thermotoga sp. EMP 


Amylopullulanase 


Thermococcus sp. AM4* 




Glycerate kinase 


Thermoanaerobacter sp. X513* 


Aldehyde dehydrogenase 


Desulfitobacterium dichloroeliminans LMG P-21439 




Rhodothermus marinus SG0.5JP17-172 




Melioribacter roseus P3M 




Thermoplasma volcanium GSS1* 




Calditerrivibrio nitroreducens DSM 19672 




Uncultured Acidobacteria bacterium 


Glycerol dehydrogenase 


Archaeoglobus fulgidus DSM 4304* 



Thermalphilic species were marked with an asterisk. 
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Table A3 | List of enzymes involved in hydrocarbon biodegradation in metagenomes from different environments, presence was marked with 
"+," while absence with "— ." 







Transposase 




Toluene 


Ethyl benzene 


o-Xylene 




No. of hits 


% 


Presence 


Presence 


Presence 


4558-6 


o I o 

O I O 




1.32 


i 


i 


_i_ 


h ian Hp Fi ira 


1 CO 

I Do 




1.10 








Lost City 


I OO / 




9.64 








Acid mins drainags 


1 1 Q 




2.27 








Rwav arirl minp 
■jvvcjy auivj mimics 














Wnrm 

V VUI 1 1 1 


zuu 




2.80 








Pp n i nri 1 






0.49 








Pp n i amn1 
i c i li ci i i i \ j i 


10 




0.18 








Pp n i amn1 fi 

1 CI U Q 1 1 1 YJ 1 \J 


6 




0.10 








Ppni amn?9 

1 C 1 U a 1 1 \\Jkj£. 


12 




0.11 








Peru_amp50* 


6 




0.08 





+ 





HOT_10 


1 




0.03 








HOT_70 


3 




0.08 








HOT_130 


2 




0.08 








HOT_200 


3 




0.09 








HOT_500 


13 




0.32 








HOT_770 


15 




0.31 








HOT_4000 


49 




1.03 









'Hit property: e-value 4.0e-12, similarity 50%, ethylbenzene dehydrogenase alpha subunit. 
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